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["T^ ■ Abstract 

In this note we show that the locally stationary wavelet process can be decomposed 
"jrt I into a sum of signals, each of which following a moving average process with time- varying 

C/3 , parameters. We then show that such moving average processes are equivalent to state 

space models with stochastic design components. Using a simple simulation step, we 
propose a heuristic method of estimating the above state space models and then we apply 
C^ I the methodology to foreign exchange rates data. 

Some key words: wavelets, Haar, locally stationary process, time series, state space, 
CO ' Kalman filter. 
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Q ; 1 Introduction 

> '■ 

■ '-j . Nason et al. (2000) define a class of locally stationary time series making use of non-decimated 

^ ■ wavelets. Let {yt) be a scalar time series, which is assumed to be locally stationary, or 



stationary over ceratin intervals of time (regimes), but overall non-stationary. For more 
details on local stationarity the reader is referred to Dahlhaus (1997), Nason et el. (2000), 
Francq and Zakoan (2001), and Mercurio and Spokoiny (2004). For example. Figure [1] shows 
the nonstationary process considered in Nason et al. (2000), which is the concatenation of 
4 stationary moving average processes, but each with different parameters. We can see that 
within each of the four regimes, the process is weakly stationary, but overall the process is 
non-stationary. 
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Figure 1: Concatenation of four MA time series with different parameters. Overall the process 
is not stationary. The dotted vertical lines indicate the transition between one MA process 
and the next. 



The locally stationary wavelet (LSW) process is a doubly indexed stochastic process, 
defined by 

-1 T-l 

yt= J2 ^ '^jk^j,t~kijk, (1) 

j=-J k=0 

where ^jk is a random orthonormal increment sequence (below this will be iid Gaussian) 



and {il^jk}j.k is a discrete non-decimated family of wavelets for j 



-1, 2, . . . , J , k 



0, . . . ,T — 1, based on a mother wavelet ip{t) of compact support. Denote with lAix) the 
indicator function, i.e. lAi^) = 1, ii x £ A and Ia = 0, otherwise. The simplest class of 
wavelets are the Haar wavelets, defined by 

for j G {—1, —2, . . . , —J} and A: € {. . . , —2, —1,0, 1, 2, . . .}, where j = —1 is the finest scale. It 
is also assumed that E(^jk) = 0, for all j and k and so yt has zero mean. The orthonormality 
assumption of {^jk} implies that Cov{^jk,S,im) = ^jiSkm, where 6jk denotes the Kronecker 
delta, i.e. 6jj = 1 and 6jk = 0, for j / k. 



The parameters Wjk are the amplitudes of the LSW process. The quantity uijk character- 
izes the amount of each oscihation, ipj,t-k at each scale, j, and location, k (modified by the 
random amplitude, £,jk)- For example, a large value of Wjk indicates that there is a chance 
(depending on ^jk) of an oscillation, ipj^t-k, at time t. Nason et al. (2000) control the evolu- 
tion of the statistical characteristics of yt by coupling Wjk to a function Wj{z) for z € (0, 1) 
by Wjk = Wj{k/T) + 0{T~^). Then, the smoothness properties of Wj{z) control the possible 
rate of change of Wjk as a function of k, which consequently controls the evolution of the 
statistical properties of yt- The smoother Wj{z) is, as a function of z, the slower that yt can 
evolve. Ultimately, if Wj{z) is a constant function of z, then yt is weakly stationary. 

The non-stationarity in the above studies is better understood as local-stationarity so that 
the WjkS are close to each other. To elaborate on this, if Wjk = Wj (time invariant), then yt 
would be weakly stationary. The attractiveness of the LSW process, is its ability to consider 
time-changing Wj^s. 

Nason et al. (2000) define the evolutionary wavelet spectrum (EWS) to be Sj{z) = 
\Wj{z)\'^ and discuss methods of estimation. Fryzlewicz et al. (2003) and Fryzlewicz (2005) 
modify the LSW process to forecast log-returns of non-stationary time series. These authors 
analyze daily FTSE 100 time series using the LSW toolbox. Fryzlewicz and Nason (2006) 
estimate the EWS by using a fast Haar-Fisz algorithm. Van Bellegem and von Sachs (2008) 
consider adaptive estimation for the EWS and permit jump discontinuities in the spectrum. 

In this paper we show that the process yt can be decomposed into a sum of signals, each of 
which follows a moving average process with time- varying parameters. We deploy a heuristic 
approach for the estimation of the above moving average process and an example, consisting 
of foreign exchange rates, illustrates the proposed methodology. 

2 Decomposition at scale j 

The LSW process ([TJ can be written as 

-1 

j=-J 
where 

T-l 

Xjt = ^ Wjk1pj,t-kCjk- (3) 

k=0 

For computational simplicity and without loss in generality, we omit the minus sign of the 
scales (— J, . . . , — 1) so that the summation in equation ([2]) is done from j = 1 (scale —1) until 
j = J (scale —J). 



Using Haar wavelets, we can see that at scale 1, we have from ([3|) that xn = ipifiWuiit + 
il^i.~iwi^t-iii,t~i^ since there are only 2 non-zero wavelet coefficients. Then we can re- write ([3]) 
as xit = ai/^it + ai/Ci.t-ii which is a moving average process of order one, with time-varying 
parameters a]^^ and an ■ This process can be referred to as TVMA(l) process. 

In a similar way, for any scale j = 1, . . . , J, we can write 

so that we obtain the TVMA(2'' — 1) process 



i^jt 



4?0* + a^^Cj,t^i + ■■■ + a^;f-'k,,t-2^+i, (4) 



where a-^ = ijjj-iWj^t-i) for all £ = 0, 1, . . . , 2-' — 1 and j = 1, . . . , J. Thus the process yt is 
the sum of J TVMA processes. However, we note that not all time-varying parameters a,/ 
{i = 0,1, ... ,2^ — 1) are independent, since, for a fixed j, they are all functions of the {wjt} 
series. 

We advocate that Wjt is a signal and as such we treat it as an unobserved stochastic 
process. Indeed, from the slow evolution of Wjt, we can postulate that Wjt — Wj,t-i ~ 0, which 
motivates a random walk evolution for Wjt or Wjt = Wj^t-i + Cjt, where Qt is a Gaussian white 
noise, i.e. Cjt '^ -^(0, cr^), for a'j a known variance, and (jt is independent of (kt, for all j 7^ k. 
The magnitude of the differences between Wj^t-i and Wjt can be controlled by cr? and this 
controls on the degree of evolution of Wjt as a function of t and hence on yt through ([2]) . 

At scale 1 we can write xu as 

Xlt = ^PlfiWuClt + V'l -iw^i,t-i6,t-i = (^i,o6t + ^i,-i6,i-i)w^i,t-i + -ipiflCitCit, 
where we have used wu = wi^t-i + Cit- Likewise at scale 2 we have 

3^24 = '4'2fiW2ti2t + i^2~lW2,t-~1^2,t~l + 1p2-2'W2,t~2^2,t~2 + "02 -3'"^2,t-36,t-3 
= (V'2,o6i + 02 -l6,t-l + 02 -26,t-2 + 02 -36,t-3)'W2,t-3 
+V'2,oC2,t-26t + ^2,oC2,t-l6t + ^2,oC2t6t 
+'02,-lC2,t-2C2,t-l + '02,-lC2,t-l'?2,t-l 
+'02,-2C2,t-2C2,t-2, 

where we have used w;2,t-2 = w'2,t-3 + C2,t-2, w^2,t-i = W2,t-3 + C2,t-2 + C2,t-i and W2t = 

W2,t-3 + C2,t-2 + C2,t-1 + C2i- 

In general we observe that at any scale j = 1, . . . , J we can write 

2J-1 2J-2 2J-2 

Xjt = ^ lpj-kCj,t-kWj,t-2J+l + X] X] ^J-kCj,t-kCj,t-m, t = 2^ ,2^ + 1, . . . , (5) 

fc=0 fc=0 m=k 

4 



where the Wjt^s follow the random walk 

3 A state space representation 

For estimation purposes one could use a time-varying moving average model in order to 
estimate {wjk} in ^. Moving average processes with time- varying parameters are useful 
models for locally stationary time series data, but their estimation is more difficult that that 
of time- varying autoregressive processes (Hallin, 1986; Dahlhaus, 1997). The reason for this 
is that the time-dependence of the moving average coefficients may result in identifiability 
problems. The consensus is that some restrictions of the parameter space of the time- varying 
coefficients should be applied; for more details the reader is referred to the above references 
as well as to Triantafyllopoulos and Nason (2007). 

In this section we use a heuristic approach for the estimation of the above models. First 
we recast model ([5])- ([6]) into state space form. To end this we write 

Xjt = Ajtw j^^_2i+i + ^jti (7) 

where Ajt = Y.^k=o i^j-kij^t-k and Vjt = Y1'=o^ TZzI '^j-kij,t-kCj,t-m, for t = 2^ 2^ -M, . . .. 
In addition we assume that ^^ is independent of C^, for i = 1, 2 and for any t, s, so that 

y,t -NWa^Y, V|,-fc(2^' " ^ " 1) > • (8) 

Equations dZ]), ([U]), ([5]) define a state space model for Xjt and by defining At = {Au, . . . , Ajt)' 
and by noting that Vjt is independent of Vkt-, for any j ^ k, we obtain by (l2|) a state space 
model for yt, which essentially is the superposition of J state space models of the form of ([7]), 
dSI), dS]), each being a state space model for each scale j = 1, . . . , J. 

Given a set of data y = {yi, . . . , yx}, a heuristic way to estimate {wjt}, is to simulate in- 
dependently all ^jt from A^(0, 1), thus to obtain simulated values for Ajt and then, conditional 
on At, to apply the Kalman filter to the state space model for yt. This procedure will give 
simulations from the posterior distributions of Wjt and also from the predictive distributions 
of yt+h\y^ ■ The estimator of Wjt and the forecast of yt+h are conditional on the simulated 
values of {Cjt}- For competing simulated sequences {^jt} the performance of the above esti- 
mators/forecasts can be judged by comparing the respective likelihood functions (which are 
easily calculable by the Kalman filter) or by comparing the respective posterior and forecast 
densities (by using sequential Bayes factors). Another means of model performance may be 
the computation of the mean square forecast error. 
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Figure 2: Simulated values of posterior estimates of {Sjt = w'^^}, for {yu} (GBP rate). Shown 
are simulations of {Su} and {821}, corresponding to scales 1 and 2. 



We illustrate this approach by considering foreign exchange rates data. The data are 
collected in daily frequency from 3 January 2006 to and including 31 December 2007 (consid- 
ering trading days there are 501 observations). We consider two exchange rates: US dollar 
with British pound (GBP rate) and US dollar with Euro (EUR rate). After we transform the 
data to the log scale, we propose to use the LSW process in order to obtain estimates of the 
spectrum process {Sjt = w'^^}, for each scale j. We form the vector yt = {yit,y2t)\ where yu 
is the log-return value of GBP and y2t is the log-return value of EUR. For each series {yu} 
and {y2t}> respectively, Figures [2] and [3] show simulations of the posterior spectrum {Sjt}^ 
for scales 1 and 2. The smoothed estimates of these figures are achieved by first computing 
the smoothed estimates using the Kalman filter and then applying a standard Spline method 
(Green and Silverman, 1994). We note that, for the data set considered in this paper, the 
estimates of Figures [2] and [3] are less smooth than those produced by the method of Nason 
et al. (2000). However, a higher degree of smoothness in our estimates can be achieved by 
considering small values of the variance o"?, which controls the smoothness of the shocks in 
the random walk of the lo's. 
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Figure 3: Simulated values of posterior estimates of {Sjt = w'^^}, for {y2t} (EUR rate). Shown 
are simulations of {Su} and {821}, corresponding to scales 1 and 2. 
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